On the potential of glottal signatures for speaker recognition
نویسندگان
چکیده
Most of current speaker recognition systems are based on features extracted from the magnitude spectrum of speech. However the excitation signal produced by the glottis is expected to convey complementary relevant information about the speaker identity. This paper explores the use of two proposed glottal signatures, derived from the residual signal, for speaker identification. Experiments using these signatures are performed on both TIMIT and YOHO databases. Promising results are shown to outperform other approaches based on glottal features. Besides it is highlighted that the signatures can be used for text-independent speaker recognition and that only several seconds of voiced speech are sufficient for estimating them reliably.
منابع مشابه
Advances in Glottal Analysis and its Applications
From artificial voices in GPS to automatic systems of dictation, from voice-based identity verification to voice pathology detection, speech processing applications are nowadays omnipresent in our daily life. By offering solutions to companies seeking for efficiency enhancement with simultaneous cost saving, the market of speech technology is forecast to be particularly promising in the next ye...
متن کاملGlottal modeling and closed-phase analysis for speaker recognition
This paper concerns the application of glottal models and closed-phase analysis to the problem of speaker recognition. A glottal model based on one originally proposed by Fujisaki and Ljungqvist was used in conjunction with closed-phase analysis to yield features for a speaker recognition system used in the NIST 2003 Speaker Recognition Evaluation. Scores from the system based on the glottal mo...
متن کاملSpeaker Verification Using the Shape of the Glottal Excitation Function for Vowels
This paper seeks to establish a baseline for the potential contribution of the shape of the glottal source waveform to speaker recognition. A text-dependent speaker verification experiment was performed with 4 monosyllabic words spoken repeatedly by the 16 speakers of the TI46 speech data corpus. A single fundamental period was automatically extracted from each vowel centre and inverse-filtered...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملGlottal Waveforms for Speaker Inference & A Regression Score Post-Processing Method Applicable to General Classification Problems
Contributions are made along two main lines. Firstly a method is proposed for using a regression model to learn relationships within the scores of a machine learning classifier, which can then be applied to future classifier output for the purpose of improving recognition accuracy. The method is termed r-norm and strong empirical results are obtained from its application to several text-indepen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010